Transparent Parallelization of Binary Code
نویسندگان
چکیده
This paper describes a system that applies automatic parallelization techniques to binary code. The system works by raising raw executable code to an intermediate representation that exhibits all memory accesses and relevant register definitions, but outlines detailed computations that are not relevant for parallelization. It then uses an off-the-shelf polyhedral parallelizer, first applying appropriate enabling transformations if necessary. The last phase lowers the internal representation into a new executable fragment, re-injecting low-level instructions into the transformed code. The system is shown to leverage the power of polyhedral parallelization techniques in the absence of source code, with performance approaching those of source-to-source tools.
منابع مشابه
Automatic Parallelization of Affine Loops using Dependence and Cache analysis in a Binary Rewriter
Title of dissertation: AUTOMATIC PARALLELIZATION OF AFFINE LOOPS USING DEPENDENCE AND CACHE ANALYSIS IN A BINARY REWRITER Aparna Kotha, Doctor of Philosophy, 2013 Dissertation directed by: Professor Rajeev Barua Department of Electrical and Computer Engineering Today, nearly all general-purpose computers are parallel, but nearly all software running on them is serial. Bridging this disconnect b...
متن کاملTowards Transparent Parallelization of Connectionist Systems
Much work has been done in the area of parallel simulation of connectionist systems. However, usually parallel implementation issues for artiicial neural networks have been discussed in general terms, but the actual parallel programs implement speciic network models and are written in programming languages like C or C++. This paper deals with the transparent parallelization of neural networks. ...
متن کاملAffine Parallelization of Loops with Run-Time Dependent Bounds from Binaries
An automatic parallelizer is a tool that converts serial code to parallel code. This is an important tool because most hardware today is parallel and manually rewriting the vast repository of serial code is tedious and error prone. We build an automatic parallelizer for binary code, i.e. a tool which converts a serial binary to a parallel binary. It is important because: (i) most serial legacy ...
متن کاملShared Memory Parallelization of the GROMOS96 Molecular Dynamics Code
This paper describes the parallelization of a commercial molecular dynamics simulation code, GROMOS96, on a SCI (Scalable Coherent Interface) interconnected PC cluster. The underlying programming model is that of shared data structures, exploiting SCI’s capabilities of enabling access to segments of remote memory in an entirely transparent way. Methodologies are elaborated that allow to obtain ...
متن کاملTransparent runtime parallelization of the R scripting language
Scripting languages such as R and Matlab are widely used in scientific data processing. As the data volume and the complexity of analysis tasks both grow, sequential data processing using these tools often becomes the bottleneck in scientific workflows. We describe pR, a runtime framework for automatic and transparent parallelization of the popular R language used in statistical computing. Reco...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011